Tag

#language model

24 articles

German AI consortium releases Soofi S, an open 30B model that tops benchmarks in both English and German

A German AI consortium has released Soofi S 30B-A3B, an open language model that outperforms existing competitors on both English and German benchmarks.

Jul 1335

OpenAI staffer maps out which of GPT-5.6 Sol's five reasoning levels fits which task complexity

OpenAI staffer Vaibhav Srivastav outlines how GPT-5.6 Sol's five reasoning levels align with task complexity, recommending a gradual scaling approach for optimal performance.

Jul 1036

tech

Meet Nemotron Labs 3 Puzzle 75B A9B: A Compressed Hybrid MoE LLM Delivering 2.03x Server Throughput

NVIDIA introduces Nemotron-Labs-3-Puzzle-75B-A9B, a compressed hybrid MoE LLM delivering 2.03x server throughput, leveraging hardware-aware compression and knowledge distillation.

Jul 934

NVIDIA Releases Nemotron-Labs-3-Puzzle-75B-A9B: A Compressed Hybrid MoE LLM Delivering 2.03x Server Throughput at Matched User Throughput

Learn how NVIDIA's new AI model Nemotron-Labs-3-Puzzle-75B-A9B uses compression and smart design to work faster and more efficiently than previous versions, without sacrificing quality.

Jul 839

MiniMax is building China’s biggest AI model yet, and plans to open-source it

MiniMax is set to release a 2.7-trillion-parameter AI model that will be open-sourced, marking a major step in China’s push to dominate the global AI landscape.

Jul 839

NVIDIA Releases Audex (Nemotron-Labs-Audex-30B-A3B): A Unified Audio-Text LLM That Preserves the Text Intelligence of Its Backbone

Learn how NVIDIA's new AI system Audex combines audio and text processing in one powerful model, preserving text intelligence while adding speech capabilities.

Jul 722

Tencent releases Hy3 open-source model that allegedly matches models up to five times its active size

Learn how to work with mixture-of-experts (MoE) language models like Tencent's Hy3 using Hugging Face's Transformers library. This beginner-friendly tutorial teaches you to load, tokenize, and generate text with MoE models.

Jul 630

I had Gemini and Claude write my email replies - but only one sounds like me

This article explains how language model fine-tuning and personalization work in AI systems, using the comparison between Gemini and Claude's email reply generation as a practical example.

Jul 123

NVIDIA Releases Nemotron-Labs-TwoTower: an Open-Weight Diffusion Language Model Built on a Frozen Autoregressive Nemotron-3-Nano-30B-A3B Backbone

Learn to work with NVIDIA's Nemotron-Labs-TwoTower, a hybrid language model combining autoregressive and diffusion approaches for improved text generation throughput.

Jun 3042

ByteDance's "iLLaDA" is a diffusion language model that keeps up with Qwen2.5

This article explains the technical foundations of diffusion language models, how ByteDance's iLLaDA works, and why this new approach may challenge traditional autoregressive models.

Jun 2646

tools

GLM-5.2 OpenAI-Compatible API: A Hands-On Guide to Reasoning Effort, Function Calling, and Long-Context Retrieval

GLM-5.2's OpenAI-compatible API offers a streamlined way to integrate advanced AI features like reasoning effort control, function calling, and long-context retrieval into applications.

Jun 2240

Zhipu AI's GLM-5.2 closes in on closed-source leaders in coding marathons

Zhipu AI's GLM-5.2 closes in on closed-source leaders in coding benchmarks, trailing only Anthropic's Claude Opus 4.8 by one percentage point in the FrontierSWE test.

Jun 1740